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ABSTRACT 

The DNA-binding specificity and affinity of the 
dimeric human transcription factor (TF) STAT1, 
were assessed by total internal reflectance fluores- 
cence protein-binding microarrays (TIRF-PBM) to 
evaluate the effects of protein phosphorylation, 
higher-order polymerization and small-molecule in- 
hibition. Active, phosphorylated STAT1 showed 
binding preferences consistent with prior character- 
ization, whereas unphosphorylated STAT1 showed a 
weak-binding preference for one-half of the GAS 
consensus site, consistent with recent models of 
STAT1 structure and function in response to phos- 
phorylation. This altered-binding preference was 
further tested by use of the inhibitor LLL3, which 
we show to disrupt STAT1 binding in a sequence- 
dependent fashion. To determine if this sequence- 
dependence is specific to STAT1 and not a general 
feature of human TF biology, the TF Myc/Max was 
analysed and tested with the inhibitor Mycro3. Myc/ 
Max inhibition by MycroS is sequence independent, 
suggesting that the sequence-dependent inhibition 
of STAT1 may be specific to this system and a useful 
target for future inhibitor design. 

INTRODUCTION 

Transcriptional regulation in eukaryotes is complex (1,2) 
and regulated by processes as diverse as the translocation 
of transcription factors (TFs) into the nucleus (3) and ex- 
pansion of compacted DNA by chromatin remodeling 
factors. TFs play an essential role by directing RNA poly- 
merase complexes to gene targets. Understanding the 



combinatorial association of TFs with preferred DNA se- 
quences, the cistrome (4) of the cell, is an ongoing chal- 
lenge for molecular biology. Strategies such as chromatin 
immunoprecipitation coupled to microarray (ChlP-chip) 
(5) or high-throughput sequencing (ChlP-seq) (6) have 
provided novel insights into genome-wide association pro- 
files. Similarly, the binding preferences of large numbers 
of TFs have been identified using protein-binding micro- 
arrays (PBMs) (4,7,8). However, the next generation of 
such studies will need to embrace the distinction that 
TFs rarely act in isolation in vivo. Indeed, it is generally 
thought that the combinatorial assembly of TFs provides 
much of the complexity inherent to genetic networks 
(9-11), and studies of multi-protein complexes, such as 
the basal transcription initiation complex (10) and the 
interferon enhanceosome (12), reveal the functional diver- 
sity controlled by protein:protein interactions. 

Here we use our recently developed, microarray-based 
tool TIRF-PBM (total internal reflectance fluorescence 
protein-binding microarray) (13) to study the modulation 
of DNA-binding specificity of STATl, a member of the 
STAT (signal activator and transducer) family of human 
TFs, as well as that of Myc/Max, another well- 
characterized human TF. Recently, a paradigm shift in 
the understanding of STAT biochemistry has caused a 
necessary re-evaluation of the role of phosphorylation, 
dimerization and polymerization in the DNA-binding 
preference of the STATs (14). STATl exists as a func- 
tional dimer (15), and the STATs are heavily implicated 
in oncogenesis when aberrantly upregulated (16,17), often 
associated with leukemia, lymphoma and myeloma, 
although STATl is rarely the primary STAT active in 
these cases (17-19). However, the STATs also serve im- 
portant roles when constitutively active in normal cells, 
complicating strategies designed to interfere with their 
function (3,16,20). TIRF-PBM is well suited to address 
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fundamental questions about the multi-protein complexes 
involved in transcriptional regulation. 

We have evaluated several critical aspects of STATl 
biology, both of the activated, phosphorylated form, as 
well as the less active, unphosphorylated form. STATl is 
known to form larger assembhes than dimers when bound 
to DNA, including tetramers and extended polymers, 
which may significantly influence in vivo binding prefer- 
ences (14). We evaluated the effect on DNA binding with 
or without the presence of the N-terminal domain, 
required for STATl polymerization. Due to their critical 
roles in tumorigenesis, there has been great interest in 
finding ways to regulate TF function in ways that are 
specific to individual proteins (16). In this study, we 
evaluated the efficacy of several small molecule inhibitory 
compounds (21) to reduce DNA-binding affinity and to 
investigate the possibility of sequence-dependent effects in 
STATl or Myc/Max binding, which would serve as ideal 
targets for future drug discovery. 

MATERIALS AND METHODS 

DNA array preparation 

Ninety-six DNA sequences with known interactions with 
Myc/Max and STAT proteins in vivo and in vitro (22-25) 
or from promoter regions associated with the proteins in 
ChlP-chip assays (26-29) were selected, along with 
non-binding sequences as controls. dsDNA sequences 
were generated by primer extension of 5^ amino termi- 
nated, 51-mer template strands as previously described 
(13). Full DNA sequences are available in Supplementary 
Table SI. dsDNA-containing polyacrylamide-epoxide 
hydrogels were generated as previously described (13). 
The printed hydrogel spot morphology was evaluated in 
the fully hydrated and dry states. Swelled hydrogels with 
DyLight-649 and DyLight-549 labeled DNA controls 
were observed using phase contrast microscopy 
(Olympus ITX 70) and fluorescent confocal microscopy 
(Olympus Fluoview 500). Dry hydrogel spots were 
examined using scanning electron microscopy (SEM) 
with a JELO-X40 microscope at beam size 3, beam 
energy of 3-7 kV. Hydrogel samples were prepared for 
SEM imaging by Hummer 6.2 gold sputtering 
(Technics). Hydrogel characterization available in 
Supplementary Figure SI. 

Preparation of proteins 

Phosphorylated STATl (P-STATl), unphosphorylated 
STATl (U-STATl) and truncated STATl (STATltc) 
were prepared as described previously (15). c-Myc and 
Max isoform were expressed separately in E. coli as re- 
combinant, His-tagged proteins, then denatured and 
renatured together, as previously described (22). 
TATA-Binding Protein (TBP) was prepared as previously 
described (30). Purified proteins were fluorescently labeled 
with the amine-reactive dyes NHS-DyLight-649 and 
NHS-DyLight-549 (Pierce) and characterized as previ- 
ously described for TIRF-PBM (13). Final dye-protein 
conjugates were evaluated for DNA-binding ability via 
electrophoretic mobihty shift assay (EMS A) using 



P -labeled cognate DNA run on a 6% acrylamide gel at 
4°C in 0.5 X TBE for 2h at 200 V. EMSA was used to 
confirm the expected binding affinity for P-STATl on 
GAS cognate DNA, with U-STATl displaying a 
>200-fold decrease in binding affinity, as well as 
expected binding affinity for Myc/Max and TBP (data 
not shown). 

TIRF instrumentation 

TIRF experiments were conducted using a homebuilt in- 
strument to generate a uniform evanescent field across a 
plastic microscope slide printed with a microarray with 
temperature and flow rate control, described in previous 
work (13). 

Reaction conditions 

Care was taken to prevent non-specific association. To 
remove remaining reactive epoxide, a lOmM Tris-HCL 
pH 8, 10 mM ethanolamine, 0.1% SDS solution was 
added to the flow cell for 10 min at 37°C prior to trials. 
To block non-specific protein contacts, the instrument was 
washed with PBS, 5% w/v bovine serum albumin (BSA), 
1% v/v Tween-20 for 10 min at 37°C, then flushed with 
appropriate running buffer for the trial: Myc/Max 
running buffer (20 mM Tris-HCl, 60 mM KCl, 4% 
glycerol, 0.1 mg/ml BSA, pH 8.0), STATl running buffer 
(20 mM HEPES, 4% glycerol, 40 mM KCl, 40 mM CaCl2, 
2mM DTT, 0.2 mg/ml BSA, pH 8.0), or TBP running 
buffer (Ix PBS, 0.5% w/v BSA, 0.01% v/v Tween-20, 
5mM MgCy. Data collection used 20 s integrated expos- 
ures, at 25° C and a buffer flow rate of 50 |il/min. Running 
buffer with protein, or protein with inhibitor compound, 
was introduced and allowed to flow for >30 min. This was 
followed by the injection of running buffer without 
protein to induce dissociation of the TFs from DNA. 
After each trial, the flow cell was washed with Ix PBS, 
1 M NaCl, 1 % v/v Tween-20 to remove residual bound 
protein. 

Data normalization and analysis 

Each sequence was spotted on 5-10 repHcate spots per 
array, and spot intensities for each sequence were 
quantified and used for all subsequent analyses of 
affinity and equilibrium-binding intensity, as described 
previously (13). Logos in Table 1 were generated as pre- 
viously described (13). 

Inhibitor compound preparation and analysis- 
Inhibitors of STAT3 SH2 domain function (and presum- 
ably functional against closely related STAT SH2 
domains) LL3 (21), FLLLl, FLLL12 and FLLL31 
(Tom Li, Ohio State University) were suspended in 
DMSO at 5mM, then added to TIRF-PBM trials with 
50 nM P-STATl protein. For LLL3, final concentrations 
ranged from 5 |iM to 200 |iM. For FLLLl, FELL 12 and 
FLLL31, a final concentration of 20|iM was used. 
Mycro3, a small molecule inhibitor of Myc/Max dimer- 
ization, was suspended at 5mM in DMSO and used at 
a concentration of 50|iM with Myc/Max. TIRF-PBM 
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assays were conducted as above, and decreases in equilib- 
rium binding intensity by inhibitor was assessed 
(Figure 2). For LLL3, equilibrium-binding intensity at dif- 
ferent concentrations was used to derive IC50 values. 
STATl sequences were binned into three categories: 
'High Affinity' sites, sequences with a consensus STATl 
paHndromic GAS-binding site with a three nucleotide 
spacing between half elements; 'Half Sites', sequences 
with half of the paHndromic GAS motif; and 'Poor 
Affinity' sites, sequences with either no GAS site or a 
GAS site with improper spacing between palindromic 
elements. For Myc/Max, sequences were binned into two 
categories: 'High affinity', sequences with consensus 
E-box binding DNA, and 'Low affinity', sites with 
mutated or absent Ebox consensus sites. Inhibition 
across these clusters of sequences was compared using 
pair-wise two-tailed Student's ^-test with Welsh's 
correction. 

RESULTS 

Activated dimeric states are required for efficient DNA 
binding by STATl and Myc/Max 

Prior efforts have shown that the P- STAT 1/P- STATl 
homodimer and Myc/Max heterodimer are functionally 
distinct from the U- STAT 1/U- STATl homodimer and 
Myc alone, respectively (31,32). For instance, 
phophorylation at tyrosine 701 on each STATl 
monomer allows them to orient and form the functional 
anti-parallel dimer (32). Recent work has shown that 
U-STATl is not a monomer, but exists in a distinct 
parallel dimeric orientation, with the N-terminal 
domains serving as the dimerization interface rather 
than the SH2 domains, as in the P-STATl dimer (32). 
A truncated form of STATl (STATltc) that lacks this 
N-terminal domain and a short C-terminal transcription 
activation domain (expressing amino acids 132-712) was 
included to probe this behavior. The binding specificity of 
the dimeric Myc/Max complex and the specificity of Myc 
alone were estabhshed to provide comparison during in- 
hibitor trials. Our full set of proteins were individually 
evaluated for binding preference across the DNA 
sequence set, with TIRF-PBM assays used to generate 
equiUbrium binding intensity values, using real-time meas- 
urements of binding (Figure lb) captured to ensure that 
equiUbrium was reached. These normalized intensity 
values show that each active dimer has a distinct set of 
highly preferred sequences (Figure la) which generally 
map to sequences that contain a full or single-base 
mismatch consensus site for the protein. 

P-STATl, the functional dimer, displays strong 
sequence preference for consensus binding sites. Among 
its high-affinity binding sites, we find the highest binding 
from three probes (sequences 70-72) that have optimal 
GAS site consensus [characterized by a DNA sequence 
of TTCN2.4GAA(24,33)]. The most preferred site 
(sequence 70) possesses an interior spacer with sequence 
CCG, which matches the STATl consensus site in the 
JASPAR core database (28) (see Table 1 and Discussion 
section below). P-STATl showed uniformly low binding 



at 'deviant GAS sites' that lack a full consensus site. 
STATl prefers binding sites that have an intra-site 
spacer of three bases; other STAT family member 
proteins prefer to bind at sites with spacers from 2 bases 
to 4 bases (24,34), and we find that the majority of 
highequilibrium binding intensity interactions occur at 
sites with a 3 base spacer. Noticeably, sites with a 4 base 
spacer (e.g. sequences 28, 32 and 36) display very low 
binding, reflecting a greatly reduced specificity. The vari- 
ability in binding intensity of P-STATl among GAS sites 
may reflect a dependence on spacer and flanking sequence, 
but agrees with the consensus site as the strongest deter- 
minant of specificity. 

The binding of the truncated form of P-STAT 
(P- STATltc) showed overlap of binding sites with the 
full-length protein, but also displayed a large number of 
drastic changes in binding specificity; the truncation did 
not appreciably bind at some consensus sites well- 
occupied by the full-length protein, and bound well at 
some sites with no apparent GAS consensus targets. 
These differences suggest that the N-terminal domain 
may have a functional role in determining site specificity 
of the active dimeric protein. 

Similarly to STATl, the presence of an E-box (a known 
consensus motif for Myc/Max binding, sequence CACGT 
G) is the largest determinant of binding intensity 
(Table 1), consistent with prior in vitro and cell-based 
motif models in the JASPAR motif core database (28). 
As expected, on sites with no consensus binding site, the 
binding of Myc/Max is greatly diminished. Two excep- 
tions involving highly repeating DNA are evident, with 
high binding intensity for DNA with a 12 adenine repeat 
(sequence 7) as well as modest binding for DNA with a 12 
guanine repeat (sequence 34) (35). These observations 
support the model of Myc/Max binding specificity 
driven primarily by interactions with the E-box. 

In contrast to the binding of the active dimers, the 
binding of U-STATl, U-STATl tc or Myc show very 
low binding and reduced specificity for consensus DNA 
(Figure la). This does not necessarily reflect a transition 
from active dimer to inactive monomer, as both U-STATl 
and Myc are able to form homodimers. Historically, 
STAT phosphorylation was thought to be required for 
activation (36), but recent work suggests that even the 
very weak-binding properties of U-STATl may have func- 
tional consequence in the cell, with U-STATl existing 
in an equilibrium between monomeric, anti-parallel 
dimeric (DNA-binding deficient) and a minority in the 
parallel (DNA binding) form (32). The weak specificities 
displayed by Myc are Hkely not biologically relevant, as 
Myc forms homodimers only at concentrations far in 
excess of that found in most cells (37), although Myc 
displayed greatest specificity to sequences 13 and 17, 
both of which contain the CACGTGG preferred consen- 
sus binding site. 

Quantitative affinity and specificity data for STATl 
and Myc/Max 

In prior work, we estabhshed that binding intensities in 
TIRF-PBM are a direct indication of binding affinity (13). 
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Figure 1. Analysis of the binding specificity of human TFs STATl and Myc/Max across DNA sequences containing cognate, mutant or no 
consensus binding sites in TIRF-PBM. (a) Equihbrium binding fluorescence intensities of Dy Light 649 (used to label each protein) for each condition 
are shown in a spectrum from lowest (violet) to highest (red) signal (gray indicates poor quahty data excluded from analysis). Colored bars on left 
separate sequences into regions of interest, and show consensus identities for all sequences in that region, (b) Insets show examples of binding 
equihbrium data displaying association and dissociation regimes for P-STATl and Myc/Max used to generate the equihbrium binding intensities. 
Each trace shows binding to a different sequence on the array, (c) The addition of a 10-fold molar excess of U-STATl to P-STATl does not change 
the binding preferences of P-STATl. Representative examples shown from experiments where P-STATl, 25 nM, with (light colors) or without (dark 
colors) a 20min pre-incubation with 250 nM U-STATl, was tested across an array of STAT recognition sequences. No significant differences in 
binding preference or affinity were observed. 
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To further probe this relationship, the real-time binding 
behavior and kinetics of P-STATl and Myc/Max across 
the TIRF-PBM array were measured (representative data 
in Figure lb). The time-dependent binding intensity 
change observed is analogous to the association and dis- 
sociation traces common in surface plasmon resonance 
(SPR) measurements, and TIRF-PBM conditions have 
been optimized to avoid common pitfalls of these tech- 
niques (i.e. mass transport limitations) (13). We 
compared the overall observed binding affinity, Kj^, on a 
consensus binding site in TIRF-PBM with comparable 
measurements from the literature. P-STATl and Myc/ 
Max have affinity constants that are accurate and 
precise compared to prior measurements (Table 1). For 
example, P-STATl in TIRF-PBM on an optimal GAS 
site (sequence 70) displays an affinity constant of 
3.4±0.1nM, consistent with prior measurements (15), 
and the logogram of high-affinity binding sites (consensus 
motif TTCYVVGAA) reflects good agreement with 
known binding motifs in JASPAR (28) and 
TRANSFAC (P-value 0.00074) (38). The truncated form 
of STATl, P-STATl tc, shows a divergent binding pattern 
from the full-length protein (Figure la), but retains high- 
affinity (7.9 nM) binding at a GAS consensus site 
(sequence 51) and a logogram that is consistent with 
normal STATl binding (Table 1). 

Myc/Max displayed a K^^ of 0.9 ± 0.2nM for its 
cognate DNA, consistent with the sub-nanomolar 
affinity values previously reported (37,39), and a binding 
logogram very similar to the motif from the JASPAR 
database (Table 1) (28). The JASPAR core Myc motif is 
derived from ChlP-seq data (40), and the tight correlation 
(P-value of 4.9e-05) between our observed motif and the 
Chip derived motif suggests that PBMs are non-disruptive 
of biologically relevant patterns of interaction. 

U-STATl, U-STATltc and Myc display affinity con- 
stants for consensus DNA that are much weaker than 
that observed for the active forms. U-STATl displays 
weak affinity to optimal GAS site DNA (T^d of 
>400nM), over a 100-fold impacted from P-STATl 
binding, consistent with prior observations (32), and re- 
flective of the disruption of the proper SH2 domain- 
mediated protein:protein interface that allows the 
dimer's DNA-binding interface to form. Intriguingly, 
prior reports have shown that U-STATl can form a 
homodimer in vitro through a different interacting 
surface in the N-terminal region, leading to an anti- 
parallel dimeric form (32). Here, U-STATl, despite its 
very low binding, displays a binding logogram showing 
preference for the 5^ half of the palindromic GAS consen- 
sus site, which agrees well with prior cell-based work that 
shows U-STATl can activate gene expression on only the 
5^ end of the palindromic GAS site (41). To further inves- 
tigate this behavior, we compared U-STATl binding to 
the binding of U-STATl tc, which lacks the region respon- 
sible for the anti-parallel conformation and thus should be 
unable to form dimers. U-STATltc shows neghgible 
binding in this study, but the weak binding shows a 
weak preference for the TTC region of half of the GAS 
consensus site, suggesting that monomeric STATl may 
exhibit binding to the GAS consensus site, although the 



affinity for these interactions is Hkely below the threshold 
of physiological importance. For Myc alone, binding 
affinity was also impacted compared to the Myc/Max 
heterodimer, with a measured of >l|iM, over a 
1000-fold loss in affinity. This drop in affinity is consistent 
with expectations (37), and reflects the poor ability for 
Myc/Myc homodimers to efficiently bind DNA. 

These results show that TIRF-PBM accurately captures 
affinity constants for the human TFs STATl and Myc/ 
Max, and offers a route to explore binding differences 
over a wide range of affinity. The range of observed 
affinities between highly preferred and un-preferred 
targets reflects the selectivity of the protein, and we find 
almost three orders of magnitude change in affinity for 
these human TFs (Table 1, range of X^d^^^ observed). 
This reflects the transition from weak or non-specific 
TF:DNA interactions to highly specific, high-affinity 
binding, an important range to capture, as recent reports 
suggest that both types of interactions may play a role 
in vivo (4). 

STATl-binding preference is modulated by polymerization 

In the cell, STATl is in a constant flux between 
phosphorylated and unphosphorylated states, with signifi- 
cant populations of both states co-existing (32). As 
U-STATl and P-STATl simultaneously exist in vivo, we 
investigated the effect of dimerization and higher-order 
polymerization on STATl DNA binding. 

First, we considered that U-STATl and P-STATl may 
interact in the cell, giving rise to unique specificity. 
Estimates have placed the total concentration of STATl 
in the cell at ~50nM, and so to ensure an observable 
effect we tested the binding of 25 nM P-STATl in the 
presence or absence of a 10-fold molar excess of 
U-STATl. The STATl proteins were mixed and 
incubated for 1 hour, to ensure that if dimer dissociation 
was the limiting step, ample time to dissociate and 
re-associate as hemi-dimers of P-STATl and U-STATl 
was possible. However, even under these permissive con- 
ditions, we observed unchanged P-STATl -binding speci- 
ficity and affinity (representative measurements in 
Figure Ic), suggesting that STATl does not form hemi- 
dimers composed of one monomer of U-STATl and one 
monomer of P-STATl. 

Next, we tested an alternative model that homo-dimers 
of P-STATl and homo-dimers of U-STATl polymerize 
into tetramers and higher-order polymers [which have 
been observed for P-STATl alone (32)], and this polymer- 
ization may change P-STATl -binding preference. To 
address this possibility, we performed an experiment 
where we first allowed 50 nM P-STATl to bind to an 
array and reach equiUbrium. This was followed by 
10-fold excess of U-STATl, and the binding of 
U-STATl to the pre-bound P-STATl :DNA array was 
monitored. However, under these conditions we 
observed no U-STATl binding to the array (data not 
shown). As we observed minimal binding of U-STATl 
alone, even at 400 nM (Figure 1), this result shows that 
P-STATl does not have sufficiently strong interactions 
with U-STATl dimers to cause mixed polymerization. 
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Figure 2. The inhibition of STATl-DNA complexes but not Myc/Max-DNA complexes show sequence-dependent effects, (a) Inhibitor compound 
LLL3 and analogs were incubated with P-STATl and affect equihbrium binding intensity of P-STATl to DNA in a sequence-dependent manner 
(20|iM compound and 50 nM P-STATl). IC50 for LLL3 was shown to vary in a sequence-dependent manner (Supplementary Figure S2) and ranges 
from ~1 |iM to >60|iM. ***P-value < 0.001 for optimal GAS site affect to half GAS sites or deviant GAS sites by Student's /-test.(b) Inhibitor 
compound Mycro3 was incubated with Myc/Max and affects equilibrium binding intensity in a sequence-independent manner, (c) and (d) 
Representative data traces showing that the addition of the protein:protein inhibitors cause decreased TF binding. 



consistent with the understanding that STATl polymer- 
ization is driven by N-terminal domain contacts, which 
are occluded in U-STATl anti-parallel dimers. These 
findings support a model for STATl behavior where 
P-STATl and U-STATl do not significantly interact. 

However, tetramerization and higher-order polymeriza- 
tion of P-STATl may still have important effects on 
sequence preference. P-STATltc is missing the 
N-terminal domain predicted to be utilized in these poly- 
merization processes (32), providing an important means 
by which we can investigate the role of higher-order poly- 
merization. We find that P-STATl tc has reduced binding 
compared to P-STATl at preferred binding sites, with an 
average 2.6-fold impact on binding, and thus the 
truncated regions appear to impart a measure of affinity 
and specificity to the binding, possibly through facilitating 
polymerization. The mechanism for the affinity imported 
by this presumptive polymerization remains unclear at 
present, but intriguingly, we find that P-STATl tc binds 
at a number of sites with low sequence homology to the 
GAS consensus, suggesting more complex roles for 
polymerization. 

Small molecule inhibitors of STATl and Myc/Max 
display sequence-specific effects. P-STATl at a concentra- 
tion of 50 nM was tested with 20 |iM concentrations of the 
compound LLL3 and several LLL3 analogs (FLLLl, 
FLLL12 and FLLL31) (21) (Figure 2a, c). LLL3 has 
been previously characterized as a STAT3 dimerization 
inhibitor (21), but the high conservation among proteins 
in the STAT family (42) strongly suggested the possibility 
of an effect on STATl. LLL3 had the greatest inhibitory 
effect on 'High Affinity' sequence binding of the analogs 
tested, although all reduce equilibrium binding intensity 
by a significant extent. Across all 'High Affinity' sites, 
LLL3 shows an average adjusted binding intensity of 
12.6 ± 4.9% of the no-inhibitor control, nearly an 8-fold 
reduction in binding. In comparison, the analogs display 
an average adjusted binding intensity of 27.9 ± 5.6%. In 
previous work with STAT3, LLL3 was shown to reduce 
DNA binding to ~40% of normal activity at this 



concentration, but those assessments were performed in 
cells and in vivo (21), so the higher activity may be 
attributed to the presence of other factors that could par- 
tially rescue the disrupted interaction. 

The STAT dimerization interface contributes to the for- 
mation of the DNA-binding interface (36), and we 
investigated whether STATl dimerization inhibition 
would show differential effects against weaker affinity 
sites as well as high affinity, preferred binding targets. 
Unexpectedly, LLL3 and its analogs display markedly 
less perturbation of the low-affinity interactions 
(Figure 2a). At 'Half Sites', LLL3 showed 59.4 ± 15.5% 
adjusted binding intensity, while the analogs tested 
showed 51.5 ± 7.2% intensity. LLL3 thus shows a 
4.7-fold greater reduction in binding at 'High Affinity' 
sites compared to the perturbation of binding at lower 
affinity 'Half Sites', with the LLL3 analogs following a 
similar pattern. 'Poor Affinity' sites, similarly, showed 
42.4 ±13.1% intensity when tested with LLL3, and 
55.9 ± 10% average adjusted intensity with the other 
LLL3 analogs. The difference in inhibitory effect 
between 'High Affinity' sites and 'Half Sites' or 'Poor 
Affinity' is statistically significant, with a 
P-value < 1.0 X 10~^, suggesting that LLL3^s inhibition 
may function in a sequence-dependent manner. It is 
possible that the STATl dimer binds to 'high affinity' 
sites in a specific conformation of the DNA-binding 
domain and the SH2 dimerization domain that is readily 
disrupted, while lower affinity interactions with DNA pri- 
marily depend on weakly specific interactions that are not 
as sensitive to protein conformational changes. To explore 
this behavior, LLL3 inhibition was tested from 5|iM to 
200 |iM, allowing generation of IC50 dose-response curves 
for the binding of P-STATl at each sequence 
(Supplementary Figure S2). Each sequence showed a 
unique IC50, ranging for 1 |iM to >60|iM, supporting 
the hypothesis that LLL3 disruption of STATl DNA 
binding occurs in a sequence-dependent fashion. 

It was not apparent if this sequence-specific inhibition 
could be a general feature of dimeric TF interactions or 
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specific to the protein, so we pursued Myc/Max inhibition 
studies using Mycro3, a compound previously 
characterized as a selective inhibitor of Myc/Max dimer- 
ization (43). 50 |iM Mycro3 [comparable to the reported 
IC50 of 40 ± 13|iM obtained with a cognate-binding site 
(43)] incubated with lOnM Myc/Max showed a reduction 
in binding across the array (Figure 2b). Mycro3 caused an 
observed change in Myc/Max binding intensity at 'high 
affinity' sites from 100% to 68.5 ± 10.6% signal, and a 
statistically indistinguishable change to 66.2 ± 23.7% 
binding signal at weaker affinity sites. This decrease in 
Myc/Max binding also affected the kinetics of the reaction 
(Figure 2d), and increased the K^) observed for cognate 
site binding from 0.9 ± 0.2 nM to 3.6 ± 1.4nM. These 
results establish that Mycro3^s affect on Myc/Max 
occurs in a sequence-independent fashion, consistent 
with its proposed disruption of the formation of the 
binding-competent heterodimer complex (44), and 
suggests that the interactions of Myc or Max with DNA 
do not significantly alter this inhibition. As this Myc/Max 
compound study did not reveal sequence-dependent inhib- 
ition, the sequence-specific inhibition observed with 
STATl and LLL3 appears to be specific to the TFs and 
compounds tested. 



DISCUSSION 

This TIRF-PBM study of the full length, dimeric, human 
TF STATl demonstrates the substantial variation in 
DNA sequence preference caused by STATl phosphoryl- 
ation, multimerization capability, and in the presence of 
small molecule inhibitors of dimerization. TFs in 'active' 
states (i.e. the tyrosine-phosophorylated homodimer of 
STATl or the heterodimeric complex of Myc and Max) 
display binding specificity and affinity in agreement with 
prior work, although neither the binding motif in 
JASPAR nor our single-protein TIRF-PBM studies recap- 
itulate the diversity of STATl -binding sites seen in 
cell-based studies (29), a general trend in TF studies 
(4,45-47). 

Probing less active states (i.e. the unphosphorylated 
form of STATl or Myc protein alone) shows that 
DNA-binding affinity is reduced 100- or 1000-fold with 
non-cognate targets, yet reveals that there is distinct 
binding preference even among these poorly bound sites. 
These observations challenge a simpHstic model where in- 
dividual STATl dimers can efficiently bind at preferred 
binding sites and that STATl polymerization serves only 
to increase binding at poor binding sites. The mechanisms 
by which higher-order polymerization orients and directs 
STATl binding remain unclear, and the biological role of 
this polymerization could be reflected in the ability to 
utilize non-ideal binding sites topologically near other 
sites, a possibility not addressed in this work. However, 
our results suggest that the N-terminal region may play an 
important role in regulating site specificity, and future 
work should investigate this possibility with studies of 
polymerized state in relation to binding. 

More broadly, these results agree with other PBM 
efforts (4) that illustrate that protein: DNA interactions 



cannot be accurately viewed as a single, reductionist con- 
sensus site. Rather, protein binding must be viewed as a 
continuous scale, with motifs of high affinity (or even 
multiple, distinct motifs of high affinity) present among 
a sea of less preferred, but still possibly functional, 
motifs. In notable agreement, a recent wide-ranging 
report on the motifs for a broad set of DNA-binding 
domains from mouse TFs found numerous high, moderate 
and low-affinity sites for each TF tested (4). 
Communicating this wide specificity in a convenient 
form is an ongoing challenge; position weight matrixes 
generate logograms that help show the range of possible 
binding sites (28,48), but still fail to accurately convey in- 
formation about TFs that bind to multiple distinct motifs. 

The expansion of TIRF-PBM methods into the screening 
of small molecule inhibitors of protein: protein interaction 
(LLL3 for STATl and Mycro3 for Myc/Max) revealed the 
unexpected, but perhaps desirable, result that LLL3 
appears to disrupt the function of STATl in a DNA 
sequence-dependent manner. As aberrant STAT activity 
plays a key role in many cancers (19), yet also plays a cen- 
trally important constitutive regulatory role (36), finding 
sequence-selective inhibitors such as LLL3 may aid in the 
design of chemo therapeutics with reduced off- target effects. 
This highlights the proposed importance of protein:protein 
inhibitors as therapeutic targets (16,49), and suggests that 
the screening of such compounds to reveal sequence- 
dependent variations may be of value (50). 

This study supports the view that STATl DNA-binding 
affinity and specificity are modulated by the dimeric con- 
formation of the TF, and that disruption of normal dimer 
interfaces by small molecule inhibition or truncation of 
the N-terminal domain of STATl will result in altered, 
but not aboHshed, DNA binding. These observations are 
consistent with recent models of STAT biology (14,32), 
and with recent findings that suggest both preferred and 
non-preferred binding sites are important for the majority 
of TFs (4,7). The modulation of the functional occupancy 
of these sites in vivo is likely to rely on the effect of various 
co-activators, competing TFs and the basal transcriptional 
machinery (13,51). As TIRF-PBM has the ability to 
resolve multiple components of protein complexes that 
bind DNA (13), there is great potential for future 
studies of STATl in concert with co-activators that may 
reveal further functional consequence and provide a 
bridge between in vitro and in vivo characterization. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Table 1 and Supplementary Figures 1 
and 2. 
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